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GDP/capita correlations are investigated in various time windows (TW), for the time interval 
1990-2005. The target group of countries is the set of 25 EU members, 15 till 2004 plus the 10 coun- 
tries which joined EU later on. The TW-means of the statistical correlation coefficients are taken as 
the weights (links) of a fully connected network having the countries as nodes. Thereafter we define 
and introduce the overlapping index of weighted network nodes. A cluster structure of EU countries 
is derived from the statistically relevant eigenvalues and eigenvectors of the adjacency matrix. This 
may be considered to yield some information about the structure, stability and evolution of the EU 
country clusters in a macroeconomic sense. 
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I. INTRODUCTION 

It is of major interest in economy to extract as much as 
possible information from the sparse and noisy macroe- 
conomic (ME) time series. Most ME indicators have a 
yearly or at most quarterly frequency. When a ME in- 
dicator time series has been produced for a very long 
time, strong evidence against stationarity alas arises 
Therefore the correlation patterns are usually investi- 
gated by moving a constant size time window (TW) with 
a constant step so that the whole time interval is scanned, 
somewhat averaging the correlations. 

The goal of the present paper is to investigate the 
weighted fully connected network of the N = 25 coun- 
tries forming the European Union in 2005 (EU-25). The 
ties between countries are supposed to be proportional 
to the degree of similitude of the macroeconomic fluctua- 
tions referring to the GDP /capita annual rates of growth 
between 1990 and 2005. The countries are abbreviated 
according to The Roots Web Surname List (RSL) [2] 
which uses 3 letters standardized abbreviations to des- 
ignate countries and other regional locations. The World 
Bank database Q is here used as data source instead of 
the Penn World Tables Q on which some data is miss- 
ing for several East-European countries. In this way, the 
investigated time span goes from 1990 to 2005. 

The system is represented by an evolving network, 
nodes being the countries; links are weights (or 
GDP/capita fluctuations). In order to extract structures 
from the network, we average the time correlations in 
different windows, i.e. we assume that one can consider 
"average degrees" , etc. The matrix-based method reveals 
the emergence of a number of " common factors" , through 



the main eigenvectors (Kaiser criterion and Cattel scree 
test). The usual expectations defined by politicians or 
economists through geographical connexions [!, H, 0] are 
observed. 

Moreover we introduce the country overlapping hierar- 
chy index . Further study would be of interest in order to 
see its role within the network global evolution as a func- 
tion of hierarchy levels through correlations with other 
indices of studies. 



II. DATA AND METHDOLOGY 

The here below investigated ME indicators are 
the GDP/capita annual growth rates. Indeed, the 
GDP/capita is expected to reflect to the largest extent 
what A. Smith called, over two centuries ago, "the wealth 
of nations" . In fact, it is expected to account both for the 
economic development and for the people well being. The 
target group of countries is composed of M = 25 coun- 
tries: the 15 members of the European Union in 2004 
(EU-15) and the 10 countries which joined the European 
community in 2005 (EU-10). 



Correlation coefficient 



The correlation coefficients CV, between two ME time 
series Xi and yj, i,j = 1,...,N, is calculated in the 
present work according to the (Pearson's) classical for- 
mula: 



Cij(t,T) 



< x l y J > - < x l >< y-j > 
< x\- < Xi > 2 >< y?- < y 3 > 2 > 
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Each dj is clearly a function both of the time win- 
dow size T and of the initial time (i.e. the "position" of 
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the constant size time window on the scanned time in- 
terval). One has to note (or recall) that the correlation 
coefficients are not additive, i.e. an average of correlation 
coefficients in a number of samples does not represent an 
"average correlation" in all those samples. In cases when 
one needs to average correlations, the CV,- 's first have to 
be converted into additive measures. For example, one 
may square the dj 's before averaging, to obtain the so 
called coefficients of determination (Cfj) which are addi- 
tive, or one can convert the 's into so-called Fisher z 
values, which are also additive 0. The former approach 
is used here below, so that the average correlations are 
calculated as: 



k+T 



n 1/2 



t=k 



,fc = 0,l,...,JV-T, (2) 



where N is the total number of points (the time span), 
T is the time window size used for the analysis, v = 
N — T + 1 and t is a discrete counter variable. 

The weight of the connection between i and j reflects 
the strength of correlations between the two agents; it 
can be simply expressed as: 
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w l3 (T) = C h3 (T) 
fulfilling the obvious relations: < Wij 



< 1 



and 



= f for i = j. 



(3) 



Thus an adjacency matrix with elements Wij(T) can 
be defined. The cumulative distribution function (CDF) 
of the weights is given in Fig. 1 for different (4) time 
windows. One can see a remarkable sharpening of the 
CDF as a function of decreasing TW. 

These changes of the distribution shape can also be 
pointed out through the kurtosis (K) variation with the 
time window size (Fig. 2). For the Gaussian distribution 
Kq = 0, while for the discrete uniform distribution of m 
data (m = 300 here) it can be calculated [8| as: 



Kr 



6 m 



5 m — 1 



(4) 



It is found on Fig. 2 that the K value shifts between 
the limit Kq and Ku indeed. 



B. Weighted network characteristics 

Networks are characterized by various parameters. For 
instance, the vertex degree is the total number of vertex 
connections. It may be generalised in a weighted network 

[inn nim hies as: 



M 

E 

i=i 



w, 



(5) 



FIG. 1: The cumulative distribution function (CDF) of the 
weights set w%j for four different time window sizes. 



Thus, the average degree in the network is: 



< k >= 



1 M M 

hT,T, 



(6) 



i=i j=i 



as shown on Fig. 3. Notice a cubic root law behavior. 

Another characteristic quantity describes the number 
of triangles in the network indicating some correlations. 
In the literature, there have been several ways to evalu- 
ate assortative correlations, such as the assortativity co- 
efficient introduced by Newman [fl| that is the Pearson 
correlation coefficient of the degrees at either ends of an 
edge. Nonetheless, all of them focus on local degree- 
correlations between two connected nodes. 

Here below we introduce an overlapping index Oij 
in order to indicate/find some hierarchy of clusters on a 
network. 

First consider a non- weighted network consisting of M 
vertices; let the number Nij measure the common num- 
ber of neighbours of i and j. We impose that Oij must 
satisfy the following properties: 

(1) Oij = Nij = (fully disconnected, or "tree- 
like" network); 

(2) Oij = 1 Vi =/= j in a fully connected network, where 
Nij = M — 2 ; ki = kj = M - 1; 

(3) < Oij < I, otherwise; 

{ki + kj)/2 



(4) O i:j oc Nij 



and Oij oc < k >ijz 
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FIG. 2: The kurtosis of the weights set Wij versus the time 
window size. Inset: the double logarithm of the Wij's prob- 
ability density function for 5 years time window size. The 
thick line has a ±2 slope, corresponding to the Gaussian dis- 
tribution. 



FIG. 3: The average degree < k > and the variance of the 
weights set Wij in the EU-25 network versus the time window 
size. Variance is normalised to its maximal value. Inset: < 
k > versus T in log-plot for emphasizing the inverse cubic 
root law. 



A quantity satisfying all these conditions (l)-(4) can be TABLE I: Average overlapping index of each EU-25 country 
denned as: in decreasing order; 2 decimals have been used only 



Nijiki + kj) 



(7) 



2(M-l)(M-2) 
For a weighted network, Eq.(7) may be generalised as: 



M 



°ij = A ^2( W U + w 3l) 
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(8) 



SWE 0.38 SVK 0.37 AUT 0.35 POL 0.34 LTU 0.32 
GER 0.37 BEL 0.36 FIN 0.35 MLT 0.33 LVA 0.31 
FRA 0.37 IRL 0.36 PRT 0.35 GRC 0.33 CZE 0.31 
DNK 0.37 LUX 0.36 NLD 0.35 CYP 0.32 EST 0.30 
HUN 0.37 ESP 0.35 ITA 0.35 SVN 0.32 GBR 0.29 



where A = 1/[2(M - 1)(M - 2)] and i ^ j. 

One can easily see that < < 1, and Ojj = 1 
only for all = 1, i.e. fully connected non- weighted 
network. However, for a weighted network, Oij can never 
be zero. 

Each overlapping index is thus computed for each EU- 
25 country using the adjacency matrix defined in Eq. 
3. A country average overlapping index < Oi > can be 
next assigned to each country, i.e. dividing the sum of 
its overlapping indices by its number of neighbours: 



1 M 

<a >= mTiS^ 



(9) 



The results are shown in Table I. Due to lack of space, 
the reader is invited to draw some interesting conclusion 
by mere observation. 
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FIG. 4: The six largest eigenvalues (EVZ; Z = 1,6) of the 
adjacency matrix [wij] for the EU-25 weighted network versus 
the moving time window size. The eigenvalues are normalized 
to the correlation matrix size (M = 25), thus the vertical axis 
may be read as a fractional contribution to the total variance. 

C. Clusters 

Finally let us question whether clusters exist. This is 
done through the study of the eigenvalues and eigenvec- 
tors of the correlation matrix defined here above. Let us 
recall firstly that the eigenvalues can be interpreted as 
the proportion of variance explained by each canonical 
correlation relating two sets of variables. There will be 
as many eigenvalues as there are canonical correlations 
(roots), and each successive eigenvalue will be smaller 
than the last since each successive root will explain less 
and less of the data. In factor analysis, the eigenvectors of 
a correlation matrix correspond to factors, and eigenval- 
ues to factor loadings. The observable random variables 
are modeled as linear combinations of the factors, plus 
the error terms. 

Having a measure of how much variance each succes- 
sive factor extracts, one can call the question of how 
many factors to retain. By its nature this is somehow an 
arbitrary decision. However, there are some guidelines 
that are commonly used, and that, in practice, seem to 
yield the best results. Firstly, we can retain only factors 
with eigenvalues greater than 1. In essence this is like 
saying that, unless a factor extracts at least as much as 
the equivalent of one original variable, one has to drop 
it. This criterion, firstly proposed by Kaiser [l5| . is prob- 



FIG. 5: The cluster-like structure of the EU-25 countries ac- 
cording to the GDP/capita rates of growth. The country 
coordinates are the corresponding eigenvector components of 
the EU-25 weighted network adjacency matrix [wij]. 



ably the one most widely used. A graphical method is 
the scree test first proposed by Cattell [HI]. Plotting 
the eigenvalues in a simple line plot, one has to find the 
place where the smooth decrease of eigenvalues appears 
to level off to the right of the plot. To the right of this 
point, presumably, one finds only factorial scree (scree is 
the geological term referring to the debris which collects 
on the lower part of a rocky slope). Both criteria have 
been studied in detail [H, [H, [TlJ • By generating random 
data based on a particular number of factors it was found 
that the first method (Kaiser criterion) sometimes retains 
too many factors, while the second technique (scree test) 
sometimes retains too few; However, both methods were 
found remarkably convergent when the number of com- 
mon factors is not too large [l7|. The above consider- 
ations explain why the first eigenvectors (i.e. the ones 
corresponding to the largest eigenvalues) are generally 
considered as carrying the useful information. 

The six largest eigenvalues (EVZ; Z = 1,6) of the ad- 
jacency matrix [Wij] for the EU-25 weighted network are 
shown as a function of the moving time window size on 
Fig. 4. For the display the eigenvalues are normalized to 
the correlation matrix size (M = 25), thus the vertical 
axis may be read as a fractional contribution to the to- 
tal variance. "Obviously", only the first two eigenvalues 
seem to be of major interest. Thus a clustering scheme of 
the EU-25 countries can be constructed on the structure 
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FIG. 6: The EU-25 weighted network for the weights greater 
than the threshold value wt = 0.49 



FIG. 8: The EU-25 weighted network for the weights greater 
than the threshold value wt = 0.81 



of the corresponding eigenvectors; the EU-25 countries 
are positioned with respect to these vectors according to 
their coordinates in Fig. 5. 
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FIG. 7: The EU-25 weighted network for the weights greater 
than the threshold value wt = 0.69 



III. CONCLUSION 

Complex networks have become an active field of re- 
search in physics [1, [HI E2, EH, 0, [H| • These systems are 
usually composed of a large number of internal compo- 
nents (the nodes and links), and describe a wide variety 
of systems of high intellectual and technological impor- 
tance. Relevant questions pertain to the critical dynam- 
ics of properties, not only on the network, but also for 
the network structure itself. The occurrence of struc- 
tural (" order-disorder" ) transitions has been rarely stud- 
ied. Here we have reported on the structure of a 25 node 
cluster made of the EU countries linked together through 
weighted links representing the correlations between the 
GDP /capita. We have average those links in various time 
windows. For such weighted networks one can define a 
" hierarchy overlapping coefficient" . Clusters in the net- 
work appear when filtering the weights through this co- 
efficient. 

Thus the adjacency matrix (eigenvalues, eigenvectors) 
with the Kaiser criterion allows to well observe net- 
works/clusters of countries. Notice the strong relation- 
ship between the average overlapping index (Table 1) hi- 
erarchy and the position of the countries derived from 
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the correlation matrix main eigenvalues/vectors. 

Finally for any politically-economically minded ob- 
server we show (Figs. 6-8) the EU-25 network at var- 
ious weight levels, i.e. eliminating links below various 
threshold values. 

In summary, some quantitative way to describe the 
structure, hierarchy and evolution of the EU countries in 
a macroeconomic sense has been presented. 
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